EXTREME VALUE THEORY FOR RANDOM WALKS ON 
HOMOGENEOUS SPACES 

MAXIM S0LUND KIRSEBOM 



Abstract. In this paper we study extreme events for random walks on homogeneous 

spaces. We consider the following three cases. On the torus we study closest returns 

of a random walk to a fixed point in the space. For a random walk on the space of unimod- 

Q \ ular lattices we study extreme values for lengths of the shortest vector in a lattice. For a 

random walk on a homogeneous space we study the maximal distance a random walk gets 

away from an arbitrary fixed point in the space. We prove an exact limiting distribution 

on the torus and upper and lower bounds for sparse subsequences of random walks in the 

two other cases. In all three settings we obtain a logarithm law. 
"xl" \ 

1. Introduction 

Let X be a probability space and G a group acting on X. Let m be a G-invariant 
probability measure on X and fix also a probability measure ji on G. We define a random 
walk on X as a sequence of random variables Xi — g^- • • g\x where the g/s have distribution 
\i and x has distribution m. Fix a function A : X — > R. The focus of our interest is the 
random variable 



43 
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CN| ' M n = max ApQ). 



0<i<n 



There exists a natural measure on the space of all random walks on X which we denote 

o ■ 



by P and define formally in section 13.11 We are particularly interested in the existence of 
sequences a n and b n such that the distribution P(M n < a n r + b n ) has a non-degenerate 
limit and if this is the case, determining the limit. We refer to such a limit as the extreme 
value distribution of the random walk. One reason why extreme value distributions are 
interesting is that they imply asymptotics for the growth of extreme values of A(X n ). In 
many cases this turns out to be a logarithm law, namely we get that almost surely 

i- A ™ r 
rS ■ hm sup — = C 

„^oo log Tl 

for some C > 0. One result of this kind is Sullivans logarithm law for geodesies on hy- 
perbolic ci-space [23]. Kleinbock and Margulis later generalised this to certain classes of 
homogeneous spaces, see [TT], and Athreya, Ghosh and Prasad proved ultrametric ana- 
logues of this result, see PQ, [2]- 

The general framework for determining extreme value distributions is known as extreme 
value theory (EVT). EVT was first applied in dynamics by Collet [10], who studied C 2 
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transformations T of an interval. He was interested in the entrance times of T^x into a 
shrinking neighborhood around a fixed point xq and to understand this, he determined 
the limiting distribution of the maximum of — logd(T- ? x, xq). Similar results to Collets 
have since been proven for other choices of T and other types of maps, see for example 
[T3] . [T7j, [18], [19]. In the context of this paper, recent results by Aytac, Freitas and 
Vaienti [3] are particularly interesting as they apply EVT to a setting involving randomness, 
more precisely, iterations of a randomly perturbed map. Freitas, Freitas and Todd have 
developed a general framework for applying EVT to dynamical systems T : X — > X, see 

nH5ME|. 

Classically, random walks were studied as objects living on R d . However, the concept of 
random walks generalizes easily to many other spaces, for example to homogeneous spaces 
with a group action which we are particularly interested in. In [12], Eskin and Margulis 
studied recurrence properties for random walks on finite volume homogeneous spaces G/Y 
where G is a semisimple Lie group and Y a nonuniform irreducible lattice. In a series 
of papers Benoist and Quint [5], [6], [7], [8], developed this theory further by studying 
stationary measures on G/Y while also generalizing their results to p-adic Lie groups. 

The main idea of this paper is to apply EVT to random walks on homogeneous spaces. 
The level of dependency among the Xj's is the deciding factor in whether EVT can suc- 
cessfully be applied to obtain limiting distributions for the maximum of A(X{). The closer 
Xi is to being an independent sequence the easier it is to apply EVT. EVT provides 
independence-like conditions that, if satisfied by Xi, imply a limiting distribution for M n . 
The idea of this paper is to verify these conditions by rewriting the joint distribution of 
the random walk using the averaging operator. The spectral gap property of the averag- 
ing operator is the crucial ingredient in showing that the independence-like conditions are 
satisfied by the random walk. 

Our main results are divided into three different settings. In the following, let S^ and 
G^ denote the semigroup and group generated by the support of /i respectively. 

1.1. Closest returns on the torus. Let X = T^ be the <i-dimensional torus with 
Lebesque measure m and Euclidian metric d. Let G = Aut(T d ) denote the group of 
linear automorphisms of T d and fix a probability measure fi on the group. We assume that 
there is no G M -invariant factor torus T of T d such that the projection of G^ on Aut(T) is 
amenable. 

We are interested in the closest returns of a random walk to a fixed point on the torus 
and in particular, how these shortest distances distribute. Let xq G X be fixed and define 

A(x) = — logd(x,x ). 

We see that for small values of d(x, x ), A(x) becomes large hence we can study the closest 
returns of X, by looking at successive maxima of A(Xj). 

Theorem 1.1. Assume that the support of fi is bounded and that det(g — I) ^ for all 
g G S^. Then for u n = r + h logn we have that for a.e. Xq G X 



lim P(M n < u r 



-dr 
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where Vd is the volume of the unit ball in R d . 

The stationary measures of random walks on the torus were recently studied by Bourgain, 
Furman, Lindenstrauss and Mozer [9]. 

The limiting distribution implies a logarithm law. That is, 

Corollary 1.2. For P-a.e. random walk and every xq G X we have 

,. ApQ 1 

hmsup — = — . 

„^oo log n a 

Actually we will see later that we only need a sufficiently good upper bound on the 
limiting distribution of M n to derive the logarithm law. 

1.2. Shortest vectors on the space of unimodular lattices. Let X = d denote the 
space of (i-dimensional unimodular lattices and let m denote the normalized Haar measure 
on Cd- Recall that Cd can be identified with SL(d, R)/SL(d, Ti) and thus can be thought of 
as a homogeneous space. Let G = SL(d, R) and fix a probability measure /i on the group. 
Assume that C7 M is non-amenable. Set 

A(A) = max fog ( — - ) . (1.1) 

*>eA\{o} \\\v\\J 

We see that this maximum will always be attained for the shortest vector in the lattice A. 
The function A plays a crucial role in connections between flows on the space of lattices 
in R d and Diophantine approximation. 
Define 

M n , a = max A(X ai ) (1.2) 

0<i<n 

Theorem 1.3. Set u n = r + ^ logn. Let w = ^sh) where Vd is the volume of the unit ball 
in R d . There exist constants w(a) G R such that w(a) —y w as a — y oo and 

e-' we ~ dr < liminf P(M nja < u n ) < limsupP(M nja < u n ) < e ~ w{ - a)e ~ dr . 

n— >oo n— i-oo 

As the reader will notice, we are not able to prove an exact limiting distribution. Instead 
we get an upper and lower bound only differing by a constant multiple which goes to zero 
as the random walk becomes infinitely sparse. The difference between this case and the 
random walk on the torus is that one of the independence-like conditions from EVT is not 
fully satisfied in this setup. It is natural to ask what additional assumptions would suffice 
to prove an exact limit. This question is answered by the following theorem. 

Theorem 1.4. Let {mj} be a sequence in N such that {mj +1 — rrij} is strictly increasing. 
Also, let a n < /3 n denote sequences in N such that a n — y oo and N n := j3 n — a n —y oo. 
Then for u n = r + ~ log N n we have 



lim P I max A (X m .) < u n ) 

n->oo \a„<j</3„ V J/ J 

where w is the constant from Theorem \1.3l 



„—we dr 
e . 
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Again we obtain a logarithm law. 
Corollary 1.5. For P-almost every random walk and every xq G X we have 

.. A(X n ) 1 

limsup — = — . 

n^oo log n a 

1.3. Maximal excursions on homogeneous spaces. Let X = G/Y where G is a simple, 
non-compact Lie group with finite center and Y a non-uniform lattice in G. Let m denote 
the normalized Haar measure on X and fix also a probability measure \x on G. Assume 
that G^ is non- amenable. 

We are interested in the maximal distance a random walk gets away from some arbitrary 
fixed point xq G X. Therefore, define 

A(x) = d(x,x ) 

where d is a Riemannian metric on X chosen by fixing a right invariant Riemannian metric 
on G which is bi-invariant with respect to a maximal compact subgroup of G. Let M„ a be 
defined as in (jl.2p . 

Theorem 1.6. There exists constants k > 0, w > andw(a) G R such that for sufficiently 
large a we have w(a) > and 

e~ we ~ kr < liminf P(M n , a < u n ) < limsupP(M n , a < u n ) < e - w{a)e ~ k \ 

n— >oo n— s-oo 

where u n = r + i logn. 

Remark 1.7. The constant fc is explicit and has been computed in [11] (Lemma 5.6). 

Again we do not obtain an exact limit and again this relates to the inability to verify 
one of the independence-like conditions from EVT. In this setting we also do not prove an 
analogue of Theorem 11.41 The reason is that we need to know exact asymptotics for the 
tail distribution function of A, a property which we call fc-SDL (see definition 13.21) . While 
this was proven in [11] for the shortest vectors on Cd, only a weaker property called fc-DL 
is known for the Riemannian distance on homogeneous spaces. 

As in the previous cases a logarithm law follows from Theorem 11.61 

Corollary 1.8. For P-almost every random walk and for all xq G X we have 

,. A(X W ) 1 

limsup — = — . 

n^oo log n k 

Here k > is again the constant from Remark \1.7\ 

This is a random walk analogue of the logarithm law Kleinbock and Margulis proved 
for geodesies. A natural question to ask is whether one could determine the extreme value 
distribution for the geodesic flow, since it would be a generalization of the logarithm law 
mentioned. One result in this direction is by Pollicott [21J. He determines the exact 
limiting distribution for the geodesic flow on SL(2, R)/SL(2, Ta). However, the proof uses 
connections between geodesies on the upper half plane and continued fractions, a connection 
that only exists for d = 2. 
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1.4. Structure of the paper. We begin by giving a short introduction to extreme value 
theory in Section [2j This introduction is short and in no way a complete overview of the 
field. However, for the reader unfamiliar with extreme value theory, the section should 
be sufficient to understand this paper without having to look elsewhere. In Section [3] we 
formally define the random walk and introduce the main tools used in the paper. In this 
section we also show how the averaging operator and its spectral gap property is used to 
prove quasi-independence for the random walk. We prove various results for the limiting 
distribution of the random walk under general assumptions. In Section H] we finalize the 
proofs of our main theorems using the results from the previous section and known results. 
For the case of the torus an additional argument is required which we give in this section 
as well. 

2. General extreme value theory 

EVT deals with determining the distributional properties of the maximum or minimum 
of a sequence of random variables /„ as n becomes large. This task is fairly simple if one 
assumes that the random variables are mutually independent. However, in many interesting 
cases we have some degree of dependence among the random variables. How much we can 
prove in the dependent case is related to how strong the dependency among the random 
variables is. 

In the following we elaborate on the basics of EVT for stationary sequences of identically 
distributed random variables. For a reference on general extreme value theory, see [20J . 

Let (X, P) be a probability space. Let fi denote some stationary, identically distributed 
sequence of random variables and let M n : = max <j< n (/i)- We use the notation 

F/o,...,/„- 1 (r) = P(/ < r, . . . , / ft _! < r) = P({/ < r} n • • ■ n {/ n _! < r}). 

Notice that -^/ ,...,/ n _i( r ) = P(M n < r). Also notice that since the fi are identically 
distributed we have Ff d (r) = Ff.{r) for all i,j G N. We denote this common distribution 
simply by F . We are concerned with the limiting distribution of M n under linear scalings 
a~ 1 {M n — b n ), where a n > and b n are sequences of real numbers. By this we mean the 
limit 

lim P I — < r 

where rGR. The sequences a n and b n , known as scaling sequences, are introduced in order 
to avoid cases of degenerate limiting distributions, a notion we explain in the following. To 
understand why degenerate cases occur, look for example at any i.i.d. stochastic process. 
In this case we easily see that 

P(M n <r)=F(rr^l 1 * ' 6 {r : F(r) = 1} 
y n ~ > y ' |o if r E {r : < F(r) < 1} 

We call this a degenerate limiting distribution and we see that such one provides us with 
little information about M n . Later in this section we discuss how to determine a n and b n , 
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but for now assume these exist such that 

P ( Mn ~ bn <r)=P(M n < a n r + b n ) -► G(r), 

where G : R — >• [0, 1] is a non-degenerate distribution function. To simplify notation set 
u n := a n r + b n . 

As mentioned, the i.i.d. case is the simplest, and in this case the limiting distribution 
of M n is known. When dealing with the dependent case, we are interested in stationary 
sequences that only exhibit little dependency. In other words, these are sequences that 
in some sense are close to being independent. This notion is formalized through two 
independence type conditions denoted D{u n ) and D'{u n ). 

Condition D{u n ). Condition D{u n ) will be said to hold for fi and u n if for any integers 
< i\ < ■ ■ ■ < i p < ji < ■ ■ ■ < j p > < n for which j\ — i p > I, we have 

F fn ,...j lpJn ,...j v K) - F fnt ...j tp {u n )F fnt ...j v (u n ) < a{n, I), 

where there exists a sequence l n s.t. a(n, l n ) — >■ as n — > oo and — — > for n — Y oo. 
Condition D'{u n ). Condition D'{u n ) will be said to hold for fi and u n if 

[?] 
lim sup n } P(/o > u n , fj > u n ) — > as q — > oo. 

It is a standard result from EVT that if a stationary sequence fi satisfies these two 
conditions, then the limiting distribution of M n is the same as if /* were an i.i.d. process. 
This is the content of the following theorem. 

Theorem 2.1 ([20J, Theorem 3.4.1.). Let u n = a n r + b n be a scaling sequence s.t. D(u n ) 
and D'(u n ) are satisfied for the stationary sequence f n . If r = r{r) is a real function such 
that nP(/ > u n ) —¥ t, then 

lim P(M n < tin) = e~ T . (2.1) 

n— >oo 

For some cases of dependent stationary sequences, either or both of Condition D{u n ) 
and D'(u n ) are not satisfied. However, it is possible to weaken these conditions and still 
salvage some information about the limiting distribution. For the purpose of this paper we 
introduce the following weakened version of Condition D'(u n ). 

Condition D',Au n ). For the stationary sequence fi, let 

[?] 
g q (r) := limsupn^P(/ > u n , fj > u n ). 

Condition D',Ju n ) will be said to hold for fi and u n if 

limsup£fg(r) < g(r), 

q—too 
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where g : K, — > R, only depends on r G R. 

Under these weakened assumptions we can prove the following theorem. 

Theorem 2.2. Let u n = a n r + b n be a scaling sequence s.t. D{u n ) and D',Au n ) are 
satisfied for the stationary sequence fi. If T\ = Ti(r) and r 2 = r 2 (r) denote real functions 
such that 



Then 



T\ < liminf nP(/ > u n ) < limsupnP(/ > u n ) < r 2 . (2.2) 



T 2 < liminf T>(M < i, \ < liTnsnin'P/'/l/f < i, \ < p a{r)-T X 



e~ T2 < liminf P(M„ < u n ) < limsupP(M„ < u n ) < e 



The proof of Theorem 12.21 is essentially similar to the proof of Theorem 12.11 Notice 
that we also made a weakening of the assumption that nP(/o > u n ) — > r. This is to 
accommodate cases where the limit cannot be determined or does not exist. 

Until now we have assumed the existence of scaling sequences a n and b n such that the 
limit of P(M n < a n r + b n ) is non-degenerate. However, such scaling sequences do not 
necessarily exist. In the case of Theorem 12.11 the assumption that nP(/ > u n ) — > t 
provides the most straightforward way to determine if suitable a n and b n exist and, if this 
is the case, what they are. Namely, we see that if the limit function r is either zero or 
infinity, then the limit in (12. ip becomes a degenerate distribution. Thus in order to obtain 
a non-degenerate limit, we must choose a n and b n such that the limit nP(fv > u n ) — > t 
is non-trivial. In specific cases writing out the expression for nP(/o > u n ) often provides 
an easy way to see how a n and b n must be chosen in order for the limit to exist and be 
non-trivial. 

Similarly for Theorem I2.2[ if r 2 = oo then we get a trivial lower bound on the lim inf of 
P(M n < u n ). So again, by looking at the expression for nP(/ > u n ) we can often see how 
a n and b n must be chosen for the upper bound on the lim sup to be less than infinity. 

3. EVT FOR RANDOM WALKS IN A GENERAL SETTING 

In this section we define random walks on a general probability space with a group 
action. In this general setting we show how the averaging operator can be used to prove 
extreme value distributions and logarithm laws for random walks. First we introduce the 
setup and define notation. 

3.1. Notation and setup. Let (X,m) denote a probability space and G a group acting 
measurably on X preserving m. Fix also some probability measure fi on G. The product 
space G xN naturally inherits the product measure /x® N which is also a probability measure. 
We define the probability space (3^, P) by 

(y,I>) :=(C7 xN x ^,/i® N ®m). 

We denote elements in G xN by g and write these as g = (gi, . . . ,g i: . . .). 

By a random walk on X generated by G we mean a sequence of the form X,i — g%- • • g\X 
where x G X has distribution m and the gj G G have distribution fi. Define the map 
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U : G xN — > G by L l (g) = gi ■ ■ ■ g\. Then for each i, L l (g)x represents the i'th position of 
the random walk along the path g starting at x. We use the convention that L°(g) = e, 
i.e. the neutral element in G. We define the sequence of random variables X{ : y — y X by 

Xi{g,x) = L\g)x. 

We see that y can be thought of as the space of all possible random walks on X . Let 
A : X — y R. We define the sequence of real random variables ^ : y — y R, by 

^g,x) = A(L\g)x). 

and define a new sequence of random variables M n : y — > R by 

M n (g,x) := max &(g,x). 

0<i<n 

It follows from G-invariance of m that £j is a stationary sequence with respect to P. 
Stationarity in particular implies that the random variables are identically distributed and 
we let F(r) denote the common distribution function of the £j. 
We denote by 

G l = G---G = { 9l --- gi : g 3 eG,\fl< 3 <i}. 

The natural probability measure on G 1 is the convolution measure defined as the push- 
forward measure of /i®* under the map L l . That is \x* 1 = /i® N ((L J ) _1 ). It is a useful 
observation that 

f{L\g)) d^{g) = f f(g)d^(g)= /••• / f \ 9i ■ ■ ■ g 1 )d / ,(g i ) ■ ■ ■ dp(&) , 

G xN JG i JG JG 

for any function / : G l — > R. 

3.1.1. Averaging operator. As previously mentioned, the so-called averaging operator plays 
a very important role in this work. Denote by A : L 2 (X,m) — y L 2 (X,m) the averaging 
operator with respect to G given by 

Af = / f(gx)dfi(g), 

JG 

where / G L 2 (X,m). We get the n'th iterate of A by straight forward calculation, this is 

A n f= [ f(g'x)dn* n (g'). 

JG n 

Since m is G-invariant we also get 

Af dm = / / f(gx)dfj,(g)dm(x)= / / f(gx)dm(x)dfx(g)= / f dm. (3.1) 

X J X JG JG J X J X 

Notice that A is linear. 
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Definition 3.1. We say that the averaging operator has spectral gap in L 2 (X, m) if there 
exists constants A G (0, 1) and Co > such that for all / G L 2 (X, m) and all n G N 



A n f- f dm 



x 



<c A n |m| 2 . (3.2) 



2 



3.1.2. Distance-like functions. We are going to introduce two types of function A that we 
are interested in. For this we need the tail distribution function of A. We define this as 

$a(z) = m({x : A(x) > z}) , z G R. 

Notice that 

P(£o > ««) = $aK). 

Definition 3.2. For k > 0, we say that A is k-DL ("Distance-Like") if it is continuous and 
satisfies 

3 Vl ,v 2 > such that v x e~ kz < $ A (z) < v 2 e~ kz , VzGR. (3.3) 

For k > 0, we say that A is fc-SDL ("Strong-Distance-Like") if it is continuous and satisfies 

3 i>i > such that $ A {z) = v x e~ kz + o(e~ hz ) as z -> oo. (3.4) 



The notion of distance-like functions was introduced in |llj . 

Throughout the paper we will make use of big O notation as well as Vinogradov symbols 
when appropriate. So for a set S and functions /, g on S we write f(s) = 0(g(s)) if 
there exists a constant c such that \f(s)\ < c\g(s)\ for all s G S. We sometimes write 
f(s) ^C g(s) meaning the same as f(s) = 0(g(s)) and we write f(s) x g(s) if f(s) <^. g(s) 
and g(s) < f(s). 

3.2. Bounds on the limiting distribution of M n . 

Theorem 3.3. Assume that A is k-DL for some k > and that A has spectral gap on 
L 2 (X,m). Set u n = r + |logn. Then for all r G R 

g-iae-*" < liminf p(M n < u n ) < limsupP(M n < u n ) < e 9 " e ~ kr , 
/or 

fl A 

0\ = yzt\ CqV2 ~ Ul ' 

where Vi, v 2 and Cq, A are i/ie constants from Definition \3.z} and \3.1\ respectively. 

Remark 3.4. We see that for A close to 1, we get 9 X > rendering the upper bound on 
the limiting distribution trivial. However, for small values of A we get 6\ < 0, hence a 
non-trivial upper bound. 
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Naturally, the strategy of the proof will be to verify the assumptions of Theorem I2.2L 
We begin by determining the correct scaling sequences a n and b n . Assume that A is a 
k-DL function. Then for all n G N 

v x e~ kUn < <&aCO < v 2 e~ kUn , 

for constants Vi,v 2 > 0. Easily, we see that 

filiminf (ne~ Un ) < liminf n^^(u n ) < lim sup n$ a C?v) < t^limsup {ne~ Un ) . 

n— s-oo n— s-oo n— i-oo n— >oo 

Since <&&{u n ) = P(£o > u n), the upper bound on (\2.2\i will be non-trivial if we can find 
sequences a n and b n such that the limit of ne~ k ^ anr+bn ' ) exists and is non-trivial. By writing 

ne -k(a n r+b n ) _ ne ka n r e kb n ^ j g eag y ^ Q gee ^^ f Qr Q ^ _ ^ an( ^ ^ _ 1 [ g n we g e £ 



lim ne fc ( a " r+b ™) = e ~ 



fcr 



Obviously for this choice of scaling sequences we also get a non-trivial lower bound. We 
formulate this conclusion as a lemma 

Lemma 3.5. Suppose A is a k-DL function and set u n = r + | logn. TTien 
■wie~ fcr < liminf nP(£ > u n ) < limsupnP(£ > u n ) < v 2 e~ kr , 

where V\,v 2 > are £/ie constants from Definition \3.2i 

Remark 3.6. It follows immediately that if A is assumed to be A;-SDL, then the lemma 
holds with the same choice of u n . 

The next lemma verifies Condition D' ,<. (u n ) under the assumptions of Theorem 13.31 

Lemma 3.7. Assume that A is k-DL for some k > and suppose A has spectral gap 
in L 2 (X,m). Set u n = r + -rlogn. Then Condition D',Ju n ) holds for & with g(r) = 

j^rc v 2 e~ kr , where v 2 and c , A are the constants from Definition ^. 2\ and \3.1\ respectively. 

Proof. We can rewrite the joint probability of £o and £j in terms of integrals of characteristic 
functions. Set W := (u n , oo), V = {x G X : A(x) G W} and Vf = {x G <Y : &(<?, x) G VT}. 
Notice that V = VJf and 

l y5 (x) = l Vo (L\g)x). 
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Then we get 

P(£o > u n ,£j > u n ) = / / l v g nv s(x) dm{x) dn m (g) 

JgxmJx ° J 

l Vo (x)l Vo (L j (g)x)dm(x)d^(g) 
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G XN J x 

l Vo (x) [ l Vo (U(g)x)d^(g)dm(x) 

x Jg xN 

l Vo (x) / l Vo {g'x)dfi* : >(g')dm(x) 

X JGi 



l Vo (x)A j (l Vo (x)) dm(x). 



X 



Set ip : = ly to get 



P(£o > u n ,Cj >u 



] ^ ""nj 



ipA 3 (ip) dm. 



x 



Recall the Cauchy-Schwartz inequality stating that \\fg\\ x < ||/|| 2 \\g\\ 2 f° r f-,9 *= L 2 (X,m). 
We proceed by estimating the difference |P(£o > u n,ij > u n) — P(£o > w n)P(£j > u n)\- 
Written in terms of integrals we have 

ip A* (ip) dm — I ip dm / ip dm 
X J X Jx 



ip I A J (ip) — I ip dm J dm 
x V Jx 



< 



x 



ip [ A 3 {ip) - / ipdm 
x 



dm 



ip ( A J (ip) - j ipdm 
x 



(3.5) 



< \\ip\\ 2 A j (*Ij)- / i)dm 
Jx 

<c v\m 2 2 . 

The Cauchy-Schwartz inequality was used to get the second last inequality while the spec- 
tral gap property of A was applied to get the final estimate. It follows that 

P(6 > u n ,£j >u n ) < ( ipdm) + c \ j \\ip\\l . 

Since ip is a characteristic function we know that 

ip 2 dm — ip dm. 
Jx Jx 

We also notice that 

%P dm = $aW- 
x 
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Using that A is fc-DL and using that u n = r + \ log n we see that 



v. 



2 



n 2 n 



P(£o > u n ,Zj > u n ) < (v 2 e~ ku ") 2 + c Xv 2 e- ku " = ^e~' 2kr + coX^e 
We do the summation from Condition D' ,Au n ) to get 



?? 



[a] [aj 

J2 P(6 > u n , & >u n )<nJ2 (^ e ~ 2kr + c X j -e~ kr ) 

3=1 3=1 \ H / 



Recall that since A G (0, 1) we have Yl'jLi ^ = t^a so w ^en we take the limsup n ^ 00 we 
get 

[-1 
q v 2 A 



g q (r) = limsupn V]P(£o > u n ,£j > u n ) < —e 2kr + -c v 2 e 

J — ^ 

Finally taking the limsup ^^ gives 



limsup5- ? (r) < - 

q— >oa J- A 



^ -kr 



c v 2 e 



So Condition D' ,Au n ) holds with g(r) = jz^c v 2 e . D 

Remark 3.8. Notice that g(r) vanishes as the spectral gap A goes to zero. 

3.2.1. Verifying Condition D(u n ). To verify Condition D(u n ) we need to rewrite the joint 
distribution function of the £j using the averaging operator. The idea is the same as the 
one we used to rewrite the joint distribution in the proof of Lemma 13771 Now we essentially 
do the same calculation in higher generality. 

Throughout the following computation let n = (ni, . . . ,n t ) denote a fixed t-tuple of 
integers where n\ < ■■• < n t . Let W = (— oo,u n ] and again use the notation Vq = 
{x G X : A(x) G W} and Vf = {x G X : &(<7, a?) G W} introduced in the proof of Lemma 
13.71 Furthermore, set 

An:={yey-. U(v) e W, . . . ,6^) e W} . 
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Using this notation we rewrite the joint distribution function of £ ni , . . . ,£„ t in terms of 
integrals of characteristic functions. 

P(An)= / l An d(^®m) 

JG xN xX 

t 

l[l vl (x)dm(x)dfi^(g) 

t 

l[l V() (L n >(g)x)dm(x)d^(g) 

i=l 

[ [ f[^(L n >(g)x)d^(g)dm(x), 
Jx Jg* n jJ[ 



G xN J x , =l 



G xN J X i=1 



(3.6) 



where again ip := ly . It is practical to introduce the notation gua = g%- • • gj for % > j. 
We now look at the integral with respect to /i® N in ( 13. 6p . We can rewrite this integral 
using the averaging operator in the following way. First we write 

/ ]\ij(L^(g)x)d^(g)= / ••• / \{^gn i ---gix)d i x{g nt )---d i x{g 1 ). (3.7) 
Jg™ f = \ J G J G f£ 

Now, on the right hand side of (J3.7)) . look only at the integrals with respect to g nt , . . . , g nt -i+i- 
We get 



/ " / Yl^idn, ■ ■ ■ 9ix) dfi(g nt ) ■ ■ ■ dfi(g nt _ 1+1 ) 
Jg Jg l=1 

t-i „ 

= T\^(j9ln i ,l] x ) / ^{g{nt,rH^^l\g{n t ^l]X)d^ nt ~ nt - r \g [rH>nt _ 

" Jg"' - "'- 1 

t-i 

= n^^aF)^"™ 4 - 1 ^^.,,^). 
j=i 

Inserting this in (13. 7p we get 

t 
H^(L ni (g)x)dii m (g) 

t-i 



t-1 
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We repeat this step by looking at the integrals in (13. 7p with respect to g nt _ x , ■ ■ ■ , <7n*_ 2 +i- 
These integrals, rewritten in terms of the averaging operator as done above, become 

t-2 
8=1 

Again we can insert this in (13. 7p and repeat the procedure. Doing this t times eventually 
gives that the integral with respect to /i® N in ( 13. 6 p is 

A ni (iP{x)A n2 - ni (if)(x) . . . A 11 *-" 1 *- 1 {if)(x))) • • • ) • 

By integrating again with respect to m and applying (13.11) we finally get 

P(Afi) = f A ni (4>(x)A n2 - ni (ip(x) . . . A^-^- 1 {ip{x))) ...) dm{x) 
J x 

tfj{x)A n2 ~ ni (if)(x) . . . A"*-* 1 *- 1 {tp{x)) ...) dm{x). 



x 

We can simplify notation by defining the following sequence of operators. For the sequence 
n = (ni, . . . , rit) and the fixed function if) := ly Q we define E % n : L°°{X 1 m) — > L°°(^Y,m) 
recursively by 

E l M = <P, 

where if G L°°(X,m). Notice that Ei is linear since A is linear. Using this notation and 
setting ip = if) we get 



P(A fi )= / El{if))dm. (3.8) 

J x 

Having rewritten the joint distribution, we proceed by demonstrating how to apply the 
spectral gap property of the averaging operator. More explicitly, we look at how we can 
split (13.81) into a product of two integrals at the cost of an error term when A has spectral 
gap. Let n = (n l5 . . . , n p , n p+1 , . . . , n t ) and also set q = (ni, . . . , n p ) and s = (n p+ i, . . . , n t ). 
Again assume that ri\ < ■ ■ ■ < n p < n p+1 < ■ ■ ■ < n t . We want to estimate the difference 

|P(A ft )-P(A g -)P(A s -)|. 

Written as integrals this is 

El(ip) dm- J E?(if)) dm [ E*T P (if)) dm 
x J x Jx 

Notice that 

E t n (v)=E p JipA n »^(Ei- p (v))). 
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Assume now that A has spectral gap in L 2 (X,m). Let a := E s ~ p {if)). Using the linearity 
of Eq we get 



x 



El(ip)dm- / E\{if))dm / Ef p (ip) dm 



x 



x 



< 



J E P (V>,4 np+1 ~ np (o-)) dm - / E\ (ip) ( J a dm) dm 
I E\ (ipA n P +1 - n *(a)) dm- f E\ (ip f a dm] dm 
I E\ (ip ( A n " +1 - n "(cr) - I a dm)) dm 
f E P (if) ( A n ^ +1 - n "(a) - f a dm 
E P (if) [A n ^- n v((j) - 



dm 



a dm 



x 



(3.9) 



Continuing the calculation, set p := A Up+1 Tlp (a) — j x adm. Also, recall the Holder in- 
equality for L°° functions, that is, for /el 1 and g G L°° we have 



Il/0lli<ll/llill0 



1 nyiioo 



We then get 



H^K^p)^ = \\iJ>A na - ni (...il>A n *-**- 1 {iJ)p))\\ 1 

< ||VHoo||^ na ~ ni (M n8 ~ r,3 (...'0^ n '- r, »'- 1 (#))|| 1 



< 



\if)A n3 ~ n2 {. . . ^ np_np - 1 (V'P))|| 1 



< 
< 



<IM 



p-i 

oo 

V II II 

LWpWi 



Here we alternated between using the Holder inequality to split into products of norms 
and equation (13. ip to get rid of the averaging operator. The last inequality holds since if) 
is a characteristic function on a probability space. We can now continue the calculation in 



16 MAXIM S0LUND KIRSEBOM 

(13. 9p by applying the spectral gap property of A: 



\mi>p)\\i 



El 



iP[A n P +1 - np (a)- I a dm 
x 



< 



A n " +1 - n "(a) 



a dm 



x 



< coA np+1_np \\a\\ 2 

< c X np+1 - np , 

since it is easily seen that ||er|| 2 < 1. All together we have shown that 

|P(A«) - P(Ag)P(A s )| = O (\ n ^- n ?) . (3.10) 

Lemma 3.9. If A has spectral gap on L 2 (X,m) then Condition D(u n ) holds for the se- 
quence & for any choice of scaling sequence u n and any choice of A. 

Proof. Set W = (— oo, u n ] such that for n = (m, . . . , n t ) we have 

P(Aft) = P(£ ni <u n ,...,£ nt < u n ). 

We rewrite the distribution function using the averaging operator as demonstrated earlier. 
Let 1 < n\ < ■ ■ ■ < n p < n p+ i < ■ ■ ■ < n t < nn be integers such that n p+1 — n p > I. Set 
n = (ni, . . . , rip, n p+ i, . . . , n t ), q — (m, . . . , n p ) and s = (n p +i, . . . , n t ). By definition 






P(Cm <r,...,£ np <r, £ np+1 <r...,£ m < 



which means that 



can be written as 



F £ ni ,...,£ np ,£ np+1 ,...,£ nt (u n ) -i^^K)^,.^^ 



|P(A fl ) - P(A 9 -)P(A S 



(3.11) 



which by (I3.10P is bounded by O (A" p+1 Up ). Since A np+1 n " -)■ for any sequence Z n ->■ oo 
satisfying l n < n p+ i —n p , we conclude that Condition D{u n ) holds for any choice of u n and 

A. □ 



We can now conclude on the proof of Theorem 13.31 In Lemma 13.51 we determined that 
the inequalities in ( 12. 2 p are non-trivial for the scaling sequence u n = r + -rlogn and in 
Lemma I3T71 we proved that Condition D' ,Au n ) is satisfied for & with g(r) = -^\ c o v 2 — Vi- 
lli Lemma f3T9l we proved that Condition D(u n ) is satisfied for £j for any choice of u n and A. 
This means that all assumptions of Theorem 12.21 are satisfied and so Theorem 13.31 follows 
from Theorem 12.21 



Corollary 3.10. For any a G N, Theorem lff.51 holds with M n substituted by M n ^ a 
max <i <n £ ai and 

A a 



1- \ a 



c v 2 - Vx 



(3.12) 
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Proof. Set rji = £ a j and fix a G N. First notice that £j being stationary implies that rji is 
stationary. This also means that the common distribution of £j and rji is the same and so 
nothing is changed in the proof of Lemma 13.51 The appropriate scaling sequence for rji is 
therefore also u n = r + | log n. 

In Lemma 13.71 replace j by aj throughout the proof to obtain 

°° \ a 

g{r) = v 2 e~ kr Y,^ = Y^ CoV26 ~ kr - 
j=i 

In Lemma T3.9[ equation (13. lip is bounded above by A np+1 ~ np . The equivalent equation for 
rji is bounded by \ a ( n p+i~ n p) an d so Condition D{u n ) holds as well. Again all assumptions 
of Theorem 12.21 are satisfied and so the corollary follows from Theorem 12.21 □ 

3.3. Proving Theorem 11.41 in the general setting. 

Theorem 3.11. Assume that A is k-SDL for some k > and that A has spectral gap on 
L 2 (X,m). Let {mj} be a subsequence in N such that {wij+i — rrij} is strictly increasing. 
Also, let a n < (3 n denote sequences in N such that a n — > oo and N n := (3 n — a n — > oo. 
Then for u n — r + \ log N n we have 

lim P f max (f m ) < u n ] = e~ Vie ~ k \ 

7i->oo \a n <j</3 n J 

where V\> is the constant from Definition \3.S[ 

We first prove a lemma. 
Lemma 3.12. Suppose A is k-SDL for some k > and let u n = r + | logn. Then 

\imP(^<u n ) n = e- v ^- kr . 

n— >oo 

Proof. Notice that P(£o < u n ) = 1 — <&&(u n )- Using that A is /c-SDL and u n = r + r logn 
we get 

(1 - $ A K)) n = (1 - {n- l Vl e~ kr ) - o{n- l )) n 

— e nlog(l-(n^ 1 t)ie~ fer )-o(n" 1 )) /g -^g\ 

We approximate log (l — {n~ l v \e" kr ) — o{n~ 1 )^} by its second order Taylor expansion around 
to get 

log (1 - (rrVe- fer ) - o{n- 1 )) = -nT^e'^ + o^" 1 ). 

Inserting this in (13.131) we get 

(l-$ A (wn)) n = e wie " fcr+ °W. 
Taking limits gives 

lim(l-$ A K)r = e-^ e " fer . 

D 
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Proof of Theorem \3.11[ Let W = (—00, u n ] and fn{n) = (m an ,m an +i, . . . ,77i ( g n _i). Using 
the notation from section 13.2.11 we get that 

P I max (£ m ) < M n I = P(£ mQn <U n ,..., Cm^_! < «n) = P(Am(„)). 

\a„<j<p„ y 

Set g = (m an ) and s = (m an+ i, ... ,mp n -i). Recall that ip — ly - ^ then follows from 
(13~T0|) that 

P(Am(„)) = (f *pdm\ P(A S ) + O (A^n+i— ) . (3.14) 

Now set gi = (m an+1 ) and Si = (m an+2 , . . . , m^-i). We then apply (I3.10P again to get 

P(A-) = ( f tpdrn) P(A S -J + O (A m ^ +2 - mQ " +1 ) . 
Inserting this in (13.141) while using that j x ip dm < 1 gives 

P(A m(rl) ) = ( / ^rfm j P(A Sl ) + O (\™«n+2-m an+1 + X m an+1 -m an ^ _ 

Repeating this process (3 n — a n times eventually gives 

P(A„- t(n) ) =U ijdmj +0 [J2 ^ +1 ~ mi ] • (3-15) 

Recall the notation N n = /3 n — a n and notice that j x ipdm = P(£o < ^«)- Since A" n — >■ 00 
for n — > 00 it follows from Lemma 13.121 that 

lim I if) dm) = e~ Vie kr . 

Also, as A G (0, 1) and {m i+1 — mj} is strictly increasing, we see that 

/3„-2 

J2 (a™^ 1 -^) = 0(X ma " +1 - ma ") -» 0, for n ->■ 00. 

So taking limits in (" 13. 15[) gives 

lim P(A™ (n) ) = e-^ 6 "". 

3.4. Logarithm law for random walks. 



U 



Corollary 3.13. Assume that A is k-DL for some k > and that A has spectral gap on 
L 2 (X,m). Then for P -a. e. y^y we have 

hmsup = — . 

n^oo log n k 
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Proof. We prove limsup^^ f^J < ~ and limsup,,^ |gj > ~ for P-a.e. y ey. 

The proof of the upper bound is an application of the classical Borel-Cantelli Lemma. 
For completeness we give the proof. Recall the Borel-Cantelli Lemma stating that for any 
sequence A n C y we have that 

oo 

y^ P(A n ) < oo =^ P({y ey : y E A n ior infinitely many n}) = 0. 

n=l 

Let £ > be given. We look at the sequence of sets 

K. = Ug, x) : £„(£, x) > ( r + e j log n 
Since £ n is stationary we have that 

P(A n )=pU n > Q + e jiogTi 

= P (o> t^ + ejlogn 

m ( x : A (a;) > I — + £ ] logn 

*a ( ( j,+e ) log// ) . 



Since A is &;-DL we get 






l+fce 



where v 2 > is the constant from Definition 13.21 So ^]^ =1 P(i4„) < t> 2 X^Li w il fcE < 00 
implying that for P-a.e. y G y, the inequality 

£n(l/) > [l + £ ) h & n 

only holds true for finitely many n. So by taking the lim sup^^^ and dividing by log n we 
get 

,. UV) <r 1 
lim sup < — + £. 

n^oo log n fc 

Since this holds true for every e > we have proved the desired inequality for P-a.e. y G y. 
We now prove the lower bound. Assume for contradiction that the lower bound does not 
hold, i.e. assume that there exists e > such that 

P I lim sup - — — < - — £ ) > £. 

V n->oo logn k J 
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Let 

B — \ lim sup - — — < - — e 

i n->oo logn k 

For each y G B we can find sufficiently large Uq G N such that 

sup - — — < 

i>n log j fc 2 

This implies that 

t 



nn>l ^ 



SUP 1 -^- < y - - 
j>n Q log J fc 2 



Since P(-B) > e there must be some n% G N for which 



SEi^J £ |-|.H>* (3-16) 



for some 5 > 0. For any ri2>ni and any a G N we have 



(5 < P ( max -2i- < - - - < p max — =*— < - - - 

ni<j<n 2 log J fc 2/ \n 1 <j<n 2 log n 2 fc 2 



max (£ oi ) < ( - 
V*<3<P?] V* 



< P ( max n (£ oi ) < [ - - - 1 logn 2 I . (3.17) 

We now apply Corollary 13.101 with the goal of obtaining the opposite inequality. For any 
a G N we have 



lim sup P I max (£ Q ,) < r + — logn ) < e 

0<j<n k 



y x e 



-kr 



where 9\ = -^^CqV2 — V\. For simplicity we make a change of variables. Set r = ^logs 
where s G (0, oo). Then 



lim sup P I max ((„,■) < — log(sn) I < e 9xS 
rwoo \o<j<n k J 



Pick a G N sufficiently large to ensure that 6\ < 0. Let 5 > be as in (13.161) . Then for 
s > sufficiently small we get that e 9xS < ~. Also by picking n G N sufficiently large we 
get 

p fc«-^^ io «( s "0 <<; ^ ,+ l <A 

Since £ a j is stationary, we see that 

1 

ni<j<ni+n fc 



P ( max (£ oi ) < r log(sn) ) < 5. 
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Since (13. 17j) holds for any n 2 > n\ we can set n 2 : = a{n\ + n). Inserting this in A3. 1T|) gives 
P ( max (f oi ) < I - - £ - J log(m + n) j ><5. 

yni<j<ni+n V fc zy y 

Set n3 := ni +n. It is a simple calculation to show that if we choose n large enough we get 

k " 2/ log ^ 3 ^ < ^ lo s( sn )- 
This inequality implies the following sequence of inequalities, 



nr<f<n 3 — - V l - I] l0g(n3) ) - P U^J^ ^ £ 



8 < P ( max (£y) < - - - log(n 3 ) < P max (^) < r log(sn) ) < 5, 



which is a contradiction. 



□ 



4. Proofs of main results 

At this stage we are almost done with the proofs of the main results concerning maximal 
excursions and shortest vectors. The only part that remains is to combine the results of 
the previous section with known results from other papers. 

For the closest returns on the torus we still need some additional arguments specific to 
this setup. 

4.0.1. Proofs of main results for shortest vectors on the space of unimodular lattices. In the 
setup of Subsection [L2] it was proven by Kleinbock and Margulis [TT] (Proposition 7.1) that 
A (a;) as defined in (11.11) is d-SDL. In the proof of the same proposition the explicit value 
of the constant w is derived as well. Furthermore, we know from Shalom [22] (Theorem 
C) that in the same setup the averaging operator has spectral gap in L 2 . Notice that the 
theorem applies to Cd since we can identify the space with SL(<i, IR)/SL(<i, %). So Theorem 
11.31 Theorem 11.41 and Corollary [L5] follow from Corollary 13. 101 Theorem 13.111 and Corollary 
13.131 respectively Using the ci-SDL property of A and (13.121) we see that 



w{a) = — ^ a c w - w, 



so indeed, w(a) — > w for a — > oo. 



4.0.2. Proofs of main results for maximal excursions on homogeneous spaces. In the setup 
of Subsection 11.31 it was also proven by Kleinbock and Margulis [11] (Proposition 5.1) that 
A(x) = d(x,x ) is a /c-DL function for some k > 0. The spectral gap property of the 
averaging operator in L 2 in this setup also follows from Shalom [22] (Theorem C). So 
Theorem 11.61 and Corollary 11.81 follow from Corollary 13.101 and Corollary 13.131 respectively. 
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4.1. Proofs of main results for closest returns on the torus. We recall the setup 
of Theorem 11.11 Let X = T d equipped with Lebesque measure m and Euclidian metric 
d. Also, let G = Aut(T d ) equipped with a probability measure /i. Assume that there is 
no G^-invariant factor torus T such that the projection of G M on Aut(T) is amenable. We 
know from Bekka and Guivarc'h [4] (Theorem 5) that the averaging operator has spectral 
gap in L 2 (X,m). 

Let xq G X be a fixed point and set A(x) = — logd(a;, Xq)- The random variables & are 
then given by 

&(g,x) = -\ogd(L l (g)x,x ). 



The strategy for proving Theorem 11.11 is to verify the assumptions of Theorem 12.11 Notice 
that Lemma 13.91 verifies Condition D(u n ) for & with any choice of u n and A. This means 
that we are left with the task of determining the scaling sequence u n such that the limit 
of nP(£o > u n ) is non-trivial and, for this u n , verifing Condition D'(u n ). 

First we determine u n . Let B r (x ) C X denote the ball of radius r at x and V d the 
volume of the unit ball in K, d . Then 

nP(£o > Un) = nm({-logd(x,x ) > u n }) 
= nm (B e -u n (xo)) . 

As in the case of Lemma [3.51 we set u n = r + k logn. Since X is locally Euclidian we get 
that for sufficiently large n, 



-dr 



nm(B e -u n (x Q )) = nV d e dUn = V d e~ 
and taking limits we get 

lim nP(£o > u n ) = V d e~ dr . 

n— >oo 

Again, we collect this conclusion in a lemma. 

Lemma 4.1. Set u n = r + ^logn. Then for the stationary sequence £j we have 

lim nP(e > u n ) = V d e- dr . 

n— >oo 

Having determined u n , we proceed to verify Condition D'(u n ). This is the step which 
requires the most work. Fix 5 G (0, 1). Recall the Hardy-Littlewood maximal operator M, 
which for a function / : X — >■ C is given by 

Mf(x):= sup / \f(y)\dm(y). 

i?e(o,«5) m(J3 R (x)) J Br (x) 

The Hardy-Littlewood maximal inequality then states that for any / G L X (X ) we have 

m({x:Mf(x)>(3}) = 0(p- 1 \\f\\ 1 ) 
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for every (3 > 0. This version of the Hardy-Littlewood maximal inequality for functions on 
T d follows easily from the classical version for functions on R d . Let 

E i= Ug,x) : d(L\g)x,x) < - 

The next lemma gives sufficient assumptions for Condition D'(u n ) to hold. 

Lemma 4.2. Suppose that for constants a G (0, d) and k > we have that for all s > 0, 

J2 PM = O (s~ K ) . (4.1) 

Then Condition D'(u n ) holds for £$ and u n (r) = r + i logn /or a.e. io£^. 
Proof. Using that P = /i® N ®m we can rewrite the estimate as 

Y,J m(L G * : d(L^)x,a;) < H) dji**® < £. (4.2) 

Define the function $ s : A 1 -)■ R by 

[« a ] . 

*•(*) = E/ GXN 1 {. 6 *:d(L«»),^<l}(*)^* N (y), 

and apply the Hardy-Littlewood maximal operator to \l/ s to get 

MV s (x) = sup — / * s (y) dm(j/). 

H6(o,5) m{B R {x)) J B „( X ) 



Set M\l/ S (x) := Ai s (x). Using (14. 2 p and the Hardy-Littlewood maximal inequality we get 
that for every /3 > 



/?; 



({x : M s (x) >/?}) = O (/T 1 II^IIJ = O (p-'s-*) 



Let £ > 0. Set 7 = i±p and notice that 7/c — £ = l + e>l. Let n be an integer and 
substitute s with n 7 and set /3 = n~ £ . Then 

m {{x : /Vl n7 (x) > n- £ }) = O {n £ -^) . 

Since e — 7K < — 1 we see that 

00 / 00 \ 

J2m({x: M n n(x) >n- £ }) =0 [Y, 71 ^} < °°- 

n=l \n=l / 

The classical Borel-Cantelli Lemma then tells us that for a.e. xq G X 

Xq <£ limsup {x : M. n i{x) > n _e } . 
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So there exists a number iV(x ) such that for all n > N(xq) we have M. n i{xo) < n~ £ . That 

is 

M n ~t(xo) = sup — — — / ^ n7 (x) <im(x) < 7iT £ . 

Ke(o,<5) m{B R {x )) Jb r (x ) 

Choose n so large that ^ G (0, 5) and set i? = -~. Then 



/ 



B -,{x ) 



* n ,(x)dm(x)< n£+id . 



We want to switch back to the real variable s instead of the integer variable n while 
preserving the inequality above. Let s G (n, n + 1). On the right hand side of the inequality 
we can clearly substitute n with s — 1 and the inequality will still hold. The left hand side 
written out is 



E 



J ^m (B n -,(x ) H jx : d(L^)x,x) < -L j) ^ N (^). 



We see that by changing n to s inside the integral, the measure of the intersection becomes 
smaller. However, to ensure that we are not summing over more terms we need to change 
n to s — 1 in the upper limit of the sum. All together we get 

[(S ~ 1)7Q] r / r n\ v 

? J G ,, m { Bs "' {xo) n r : d(L ^ )x,x) < ^}J d ^ N ® - (g -iV^ - (43) 

We aim to connect the left hand side of (14 .3p with the sum in Condition D'(u n ). To do 
this we derive as follows using the triangle inequality for the inclusion: 

{x G X : £ > Vm,£i > u n } 

x G X : — logd(x,x ) > r + — logn, — \ogd(U(g)x,x ) > r + — logn 

a a 

er T ■ _ er r 

x G X : d(x,x ) < — r, d(L l (g)x,x Q ) < — 
rid rid 

2e~ r • _ 2e~ r 

C <{ x G X : d(x,x ) < — — , d(L l (g)x, x) < — — 

n<i rid 

x G X : d(x,x ) < p d(L*(g)x,x) < — 



EXTREME VALUE THEORY FOR RANDOM WALKS ON HOMOGENEOUS SPACES 25 

where / := ( ^pp j . Notice that the last line is exactly the set inside the integral in H4.3[) 
above with s substituted by I. Using this gives 

[(i-i)-"*] 

22 P (£o > U n ,& > Un) 
i=l 

V!-W°] , , 1 1 x 



< 



i=l 

v d 



(I - 1)7*+* 

frd+e ' 

In the last line above we replaced I — 1 with I for notational simplicity. We can do this 
since we are only interested in the behavior as n — > oo. Inserting the expression for / gives 

- = O In -> d 



hence 

[(Z_l)7t*l 






^ P(6>Wn,6>«n) = C>(n-^). (4.4) 



Since (I — 1) 7 ° = 0(n<i) and § < 1, we see that for sufficiently large n, [(/ — 1) 7Q ] < 

for any q e N. This means that the left hand side of (j4.4p does not necessarily account for 
the entire quantity that we need to estimate to verify Condition D'(u n ). To obtain this we 
need to add 

[?] 
n ^ p (£,o>u n ,£i>u n ) 

i=[(J-l)7«]+l 

to the left hand side of equation ( 14. 4p . To find an upper bound on this sum we apply the 
averaging operator exactly like in Lemma 13.71 This gives 



J2 P(& > Tin, & > U n ) < J2 ( P (£° > M «) 2 + P (^ > M «)C(A 1 )) 

i=[(l-l)-y<*]+l i=[(J-l)7"]+i 
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where A G (0, 1) comes from the spectral gap property of the averaging operator. From the 
proof of Lemma F4.ll we see that P(£o > u n) — -Vd,e~ dr . Inserting this gives 

[?] 

^2 p (£o>u n ,£i>u n ) 

i=[(J-l)7«]+l 

r n 1 

In . . . \ / I .,.. \ 

< 



( [^] " [(/ " 1)7 ° ] ) (l Vde ~ dr ) + l Vde ~ dr £ 0(AJ) 



n / 1 



j=[(/_l)T«]+l 

2 



< Z ( -V d e- dr ) + -V d e~ dr O{\^ l - l ^ +l ) 
q \n J n 

= o(— + -\^ l - l ^ +l 

\qn n 
Consequently, 

[?] 

i=[(Z-l)7<*]+l 

Adding this to (I4.4p we get 



. 9 J 

;/ 



Taking the lim sup for n — > oo gives 

[?] 
limsupn^P(£ > M n ,^ > u n ) = O (q^ 1 ) , 

and finally by letting q — > oo we obtain 

[f] 
lim sup n VJ P(£o > w n , £j > w n ) — > for g — > oo. 

n— >-oo -. 

We conclude that Condition D'{u n ) has been established. □ 

In the following set Q := supp(yu). To complete the proof of Condition D'(u n ) we need 
to show that the estimate in ( 14. ip holds for the setup of Theorem 11.11 This is the content 
of the next lemma. 

Lemma 4.3. Assume that there exists T > 1 such that \\u\\ < T for all u> G Q. Assume 
also that det(w — I) ^ for all weS^. Let a < d. Then there exists k > such that for 
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all S > 

5>(i*)=0 (*-"). (4.5) 

2=1 

Proof. The strategy of the proof is to derive two different upper bounds on 

P(Ei) = P ( Uu,x) G ft xN x X : d(2/(w)a;,a0 < - 

using two different methods. One method generates a bound that is good for small values 
of i while the other method gives a good bound for large values of %. Using the two in 
combination gives the upper bound in (j4.5p . 

4.1.1. Method 1. Let u G ft xN and for notational simplicity set 

L\uj) := w G ft*. 
For s > we look at 

E% := Ix G X : ux G Si (a;) 

A point x E X can be written as a; = y + 7L d for some |/ G [0, l] d . Multiplication by u gives 

ujx = ujy + Z d . 
Assume that a; G E%. Then 

wy G y + Si + Z d , (4.6) 

where Si is the ball of radius - at in K, d . We see that 

m (lx G X : ux G Si(x) }) = wo/^ (jy G [0, l] d : wy G y + Si + Z d \) . 
Rearranging (14. 6 p we get 

where we used that det(a> — I) ^ 0. So 

m (jx G * : oox G Si (a;)}) = m ( (w - /) _1 Si + (w - i) _1 Z d /z d ) . 

We see that (w — I)~ lr L d can at most have finitely many points in [0, l] d so the measure 
must be bounded from above by a scalar multiple of m (w - J) _1 Si J . To estimate the 
measure we first see that 

m ((u - I^Bi) = — — — m fSi V 

Since det(w — I) ^ and uj has integer entries we see that |det(o; — I)\ > 1. Then 

m {{uj - iy l Bi\ < m (bA = O (s' 4 ) . 
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To find an upper bound on the number of copies of (u — I)~ l Bi in [0, l] d , first notice that 



(u-I)- 1 



A. 



-T^jl?. 



det(w - I) 
where A is some integer matrix. This implies that 

v ; ' det(w-J) 

So the integer lattice 2£ d will at most be contracted by the factor det(w — I) in all d 
directions. This means that 

# ((w - I)- 1 ^/^) < |det(w - /)| d . 

By assumption ||w|| < T for all ui G £1 So for a; G O l it follows simply by multiplying 
matrices that ||o;|| < (dT) % . Set T = dT. By definition of the determinant we then see that 
det(w -I) < 0{f di ) for all u G Q\ Consequently 

# ((w - I)~ 1 'L d l'L d ) < 0{f dl ) d = 0{f dH ). 
Multiplying the number of sets by the measure of each set we get 



m(K 



in 



Irnd 



{uj-iy x Bx_ + {uj-I)- x 7L 



T 



o 



Finally, as the upper bound is independent of u = L*(uj), integrating over f2 xN is trivial 
and so 



P(Ei 



in 



x G X : d(L l (u)x,x) < - \ I d\x 



:N, 



W 



O 



T 



d 2 i 



/QXN \ I S 

4.1.2. Method 2. Let again w G fi xN and L 1 {uj) = u. Again, for s > we look at the set 

E w « = [x G X : raG 5i(i) 



The idea of how to estimate its measure is to find a set, which contains Eg, and whose 
measure is easier to compute. Think of X as the ci-cube [0, l] d and partition this into 
sub-cubes of the form 

d 

X 



fc=l 



jk jfc + 1 

s s 



where j^ G {0, . . . , s — 1}. Clearly, as each vector j = (j ± , . . . ,j d ) G {0, . . . , s — 1} := J 
uniquely determines one such sub-cube, we have s d cubes of volume (-) in the partition. 
Let Cj denote the cube corresponding to the vector j. Clearly 

< x G X : tux G Bi(x) ? = I) < x : x G Cj, cua; G Bi(x) ? . 



EXTREME VALUE THEORY FOR RANDOM WALKS ON HOMOGENEOUS SPACES 



29 



Notice that this is a disjoint union up to measure zero. Let 



ct ■= X 



fe=i 



h _ i jk + 1 i 

s s' s s 



Obviously Cj C C^ 1 ". We claim that 

< x : x G Cj, uiGfli(i)| C < x : x G Cj, cure G Ct > . 
This is easy to see. Set x — (x±, . . . , Xd) and ux = (yi, . . . , yd). If ux G Bi(x), then 

s 

d(ux,x) = y/(xx - yi) 2 H h (x d - y d ) 2 < -■ 

In particular this means that \x^ — y^| < - for all fc G {1, . . . , d}. Assume further that 
x G Cj. Then for every k, Xk G [— , ^7—] so we must have y^ G [— — -, ^^j— + -] implying 
that ux G Ct. So we have 

<x <E X : ux <E Bi(x)\ c[J <x : x e Cj, ux e Ct\ . 
Taking measures we get 



jeJ 



m ( < x G X : cjx G -Bi (x) >) < m 



[J Ix : x eCj, ux e Ct\ 

j€J 3 ) 

- 5Z m \{ x '■ x ^ Cj, ux e Ct\j 

= V" / l c ,(i)l c +Mdm(i). 

Integrating over f2 xN we get, 

f m([xe X :L\u)xeBi(x)X\d^(u) 

< I Y/ lc-ix)l c +(L i (u)x)dm(x)d/2 
Jn^r-^Jx 3 j 

= Y\ I ^c-ix) [ l c+ {L i {u)x)d l x m {u)dm{x) 

3&J 

= J2 j tc- 3 {x)A l (l c +{x))dm{x), 



N (u) 
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where A is the averaging operator. Performing the analogous calculation as in (13.51) we get 



/, 


Ic-A' 


t c + ) dm - 


L lc ' dm J 


'> 


+ j dm 


= 


J/c 3 (A> (l c +) 


— / t n +dm I dm 
Jx C i J 




< 


i Cj (a- (i c +) - 


/ t r +dm 1 
lx > J 


1 


< 


la, 


2 


A< (l cf ) - 


- / t c +dm 

J ?c 


2 





< \\1 



0{X 



where A G (0, 1). This gives us 

l&A* (l c± ) dm< f l c dm f (t c +) dm+ \\l c \\ \\l c+ \\ 0(X*) 
x v ' J Jx Jx v J J J 

Now, recall that there were s d sub-cubes in the partition of [0, l] d so instead of summing 



over all j G J, we may multiply by s to finally get 



P(Ei 



Sjxs 



m(<xeX : L\Q)x G Bt{x) d^ M (u) = O -r + A 



4.1.3. Combining method 1 and 2. The idea of combining method 1 and method 2 is that 
for small values of i 

O ' 



is relatively small as s becomes large. Conversely, since A G (0, 1), 



1- + X 



is small for large values of i as s grows. Thus if we use the first bound for the first values 
of i and add the second bound for the last values of i we are optimizing the total upper 
bound. 

We can write the above idea as 

[s a ] K [s a ] 

i=l i=K+l 

K rfi d 2 i 



i=l 



< 



Z-^ Q d Z-^ \ c c 



8=1 



i=K+l 



+ x 
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for some feN. Since T > 1 we can estimate the first sum by 

K fd?i 



K rptfi /rpd 2 K^ 



i=l 



S u 



For the second sum we have 



E(?+*)«E(?+*)-E(p+* 

i=K+l v / »=i \ 7 i=l V 

« J- + A* 

Choose K = 5 log s where 5 > is some constant to be determined. Inserting this we get 

[.*] 1 1 

V P(Ei) < f d2 ( 51ogs ) + — — + A 51ogs 

/ j x ' gd s 

i=\ 

^ s d 2 (<51ogf)-,i + s 51ogA + s a-d^ 

The estimate as a whole must be polynomially decreasing in s, so we need all exponents 
to be negative. This is true for a — d by assumption and for <51ogA since A G (0, 1). Also, 
by choosing 5 > sufficiently small we get that d 2 (5\ogT) — d < 0. Pick 5 such that this 
inequality is satisfied and set k := min(|o; — d\, | <5 log A | , \d 2 (5 log T)—d\). We then conclude 
that 

t=l 

D 

We can now conclude on the proof of Theorem 11.11 In Lemma 14.11 we proved that the 
correct scaling sequence was u n = r + ^ log n. Lemma 14.31 and 14.21 together prove that 
Condition D'{u n ) is satisfied under the assumptions of Theorem ll.il Condition D{u n ) was 
proven already in Lemma 13.91 This means that all assumptions of Theorem 12.11 have been 
satisfied and so Theorem 11.11 follows. 

Proof of Corollary M.Sk We want to prove the logarithm law without the assumptions on 
Q and 5^ made in Lemma 14.31 hence we cannot apply Theorem 11.11 directly. However, the 
proof of Lemma F3.7I works for the random walk on the torus as well. In this case the role 
of the fc-DL assumption is played by the fact that P(£o > u n ) = -Vde~ dr which we derived 
in the proof of Lemma 14.11 

The analogue of Lemma 13.71 for closest returns on the torus implies that the conclusion 
of Theorem I3.3[ Corollary 13.101 and Corollary 13.131 holds for closest returns on the torus. 
In particular, Corollary 13.131 then implies Corollary 11.21 

□ 
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